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We consider a nonlinear polynomial regression model in which we wish to test the null hypothesis 
of structural stability in the regression parameters against the alternative of a break at an 
unknown time. We derive the extreme value distribution of a maximum-type test statistic which 
is asymptotically equivalent to the maximally selected likelihood ratio. The resulting test is easy 
to apply and has good size and power, even in small samples. 
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1. Introduction 

Testing for structural changes has become a focus of attention in statistics and econo- 
metrics, reflected in the broadening of possible settings under consideration and the 
multitude of test statistics developed to investigate them. For recent references, see Du- 
four and Ghyscls (1996) and Banerjee and Urga (2005). In the present paper, we focus 
on the (nonlinear) polynomial regression model 

yi=x[(3^ + ei, i=l,...,n, (1.1) 

where {e,} is an error sequence that will be specified below, {(3^} are (p+ l)-dimensional 
deterministic vectors and 

K, = {l,i/n,...,{i/nrf. 
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In this setting, we are interested in testing the null hypothesis of structural stability 
against the alternative of a regime switch at an unknown time, that is. 

Ha : (3, = (3^, i^l,...,n; 
Ha ■ There is a fc* > 1 such that 

(^i = /^Oi j = 1, . . . , /c , 

i = k* + l,...,n, 

with /3q 7^ /3^. The parameters before and after the change, /3q and /3^, as well as the 
change-point, k* , arc unknown. 

Pioneering steps to analyze the structural stability for linear models were carried out 
by Quandt (1958, 1960). His contributions were subsequently refined and extended by, 
for example, Hawkins (1989), Andrews (1993), Horvath (1995) and Bai et al. (1998). 
For comprehensive reviews, we refer to Csorgo and Horvath (1997) and Perron (2006). 
While there is an extensive body of literature available for the asymptotic theory in 
linear models, one of the key assumptions in this setting requires the regressors {x^} to 
be stationary or to satisfy regularity conditions such as 

— Xixf ^ Cr, for all r G (0, 1), asn^oo, (1-2) 

where C is a (p + 1) x (p + l)-dimensional positive definite matrix and [-J denotes the 
integer part. Thus, limit results obtained for linear models do not directly apply to the 
polynomial regression addressed here. 

Groundbreaking work for polynomial regression models is due to MacNeill (1978), who 
derived, under Hq, the asymptotic distribution of partial sums obtained from regression 
residuals. In the same paper, he also proposed two test statistics to distinguish between 
stability and regime switches, and studied linear and quadratic functionals of residual 
partial sums. MacNcill's (1978) results assume independent, identically distributed errors 
{si}. For further work in this setting, see Jandhyala (1993) and Jandhyala and MacNeill 
(1989, 1997). The assumptions on the error sequence were relaxed by Kuang (1998) who 
requires {si} to obey a functional central limit theorem. He introduced a generalized 
fluctuation test based on comparions of recursive estimates with benchmark estimates 
by mimicking the test procedure of Kuang and Hornik (1995). The new test statistic 
is compared to MacNcill's (1978) versions in an extensive simulation study. Huskova 
and Picek (2005) considered bootstrap methods for more general settings, which include 
polynomial regression models as a special case. Here, the regresssion function is assumed 
to be smooth. Our work, however, is more akin to the contributions of Jaruskova (1998, 
1999) and Albin and Jaruskova (2003). In the case p=l, these papers obtain extreme 
value asymptotics for a test statistic designed to find changes in the regression line by 
explicitly computing the design matrices appearing in (1.2). 
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The paper is organized as follows. In Section 2, we precisely state assumptions on the 
innovations {si}, introduce a maximum- type test statistic, formulate the main extreme 
value asymptotic and motivate the proofs which are given in Sections 4 and 5. Section 3 
is devoted to the practical application of our theory. 



2. Model assumptions and results 

In what follows, we study the polynomial regression model as specified in (1.1), in which 
we wish to distinguish between the structural stability null hypothesis Hq and the break- 
point alternative Ha- Assuming for the moment that the error sequence consists of 
independent, identically distributed normal variables with the same known variance cr^, 
and that the time of change is k* = k, twice the logarithm of the likelihood ratio for this 
two-sample problem is given by 

in{k)=na-'{al-[al,+al,]), (2.1) 

where 

k n 

ncrls^Y^{Ui-y^Jhkf and ncr\^^ ^ {y,-^Jfy'^ 

i=l ■i=fe+l 

denote the sum of the squared residuals for the first, respectively, second sample, and 

n 

1=1 

the sum of the squared residuals for the whole sample. Therein, /3^. and /Jj. are the least 
squares estimators for /3 based on the first k and the last n — k observations. Elementary 
algebra implies that, under iJo, we have 



where 



and 



1=1 i=k+l 



T 



Sk = ^x,;y, - CfeC,/ ^Xiy, = ^Xj(2/i - xf /3„). 

z=l 2=1 i=l 

Since, in general, the time of change k* is unknown, we reject the null hypothesis Hq for 
large values of 

Tn^\ max SlC^'CnC^'Sk. (2.2) 

(7^ p<k<n-p 
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Note that, as a straightforward computation shows, the value of T„ does not change if 
X,; = (1, z/n, . . . , {i/nY)'^ is replaced by x* = (1, z, . . . , i^y . 

Still assuming that the errors are independent, identically normal, but also that the 
common variance cr^ is unknown, the resulting likelihood ratio is 

in the case where the time of change is exactly k ~ k* (see Andrews (1993) and Csorgo 
and Horvath (1997), Section 3.1.1). If k* is unknown, the stability of the regression 
coefficients is rejected for large values of 

f„= max [-21ogi„(fc)]. (2.3) 

p<k<_n—p 

Hansen (2000) studied the asymptotic distribution of the trimmed version 

Tn,s = -r^ rnax Cr^C„Cr^SA; 

with some < S < I. The limit distribution of Tn^s is the supremum of quadratic forms 
of {p + l)-dimensional Gaussian processes with a complicated covariance structure. To 
state the limit theorem for the truncated statistic T„ .5, we introduce the matrices 



C(f)= x'-^^ dx:0<i,j <p 



C(f)= (^J^ x'+^ dx:0<i,j <p 



and 



Furthermore, for t > 0, let T{t) = (/^ x' dW{x) : < i < p), where {W{t) : t > 0} denotes a 
standard Brownian motion. The proof of the following theorem is due to Hansen (2000). 



Theorem 2.1. // (1.1) and (2.2)-(2.6) hold, then, under Hq, 

Tn,5^ sup A^(t)C-i(OC(l)C(t)A(t) 

S<t<l~S 

for all 5 e (0, 1/2), where A{t) = T{t) - C{t)C-^{l)T{l) and t > 0. 



The use of the truncated statistic T„ .5 requires choosing S and so a priori excludes 
changes close to the end-points of the sample. More importantly, the computation of 
the asymptotic critical values for Tn^s is a non-trivial numerical task. A practical use of 
Theorem 2.1 would require tables for a range of values of S and p, and we are not aware 
of any such tables. 
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In contrast, the definition of T„ allows, at least in principle, to detect changes anywhere 
in the sample, and asymptotic critical values can be easily computed using Theorem 
2.2 below. The theory needed to establish Theorem 2.2 is, however, far from trivial. 
We are following previous work of Jaruskova (1998, 1999) and Albin and Jaruskova 
(2003). These contributions utilize the theory of high level exceedence probabilities for 
stationary Gaussian processes developed in Albin (1990, 1992, 2001) to obtain extreme 
value asymptotics. For classical results in the field, sec Leadbctter et al. (1983); for a 
more recent survey on extreme value theory with applications in telecommunication and 
the environment, see Finkcnstacdt and Rootzcn (2003). 

While, for motivational reasons, all test statistics in this section have been introduced 
and explained for normal error sequences, the resulting test procedures are sensitive with 
respect to changes in the regression parameters in a much more general setting. We 
assume that the errors have constant variance and arc uncorrclated, that is. 



E£i = 0, Ee^^cr^ and Ee, 







(2.4) 



and that there are two independent standard Brownian motions (standard Wiener pro- 
cesses) {Wi,n(s) : s > 0} and {W2.n{s) ■s>0} such that 



1 



max 



l<fc<n/2 



and 



max 



n/2<k<n (n — k)^/ 



2-A 



k 



i=k+l 



aWi,n{k) =Op{l) {n~*oo) (2.5) 



aW2^n{n-k) ^Op{l) (n-^oo) (2.6) 



with some A > 0. 

Sequences {si} satisfying the invariance principles (2.5) and (2.6) include many weakly 
dependent processes including GARCH-type sequences (Aue et al. (2006)), mixing se- 
quences (Shao (1993)) and martingale differences (Eberlein (1986)). Note, also, that the 
assumptions on the error sequence could be further relaxed along the lines of Qu and 
Perron (2007). 

Let T{t) = e^^y*^^ dy denote the Gamma function. The main results are the fol- 
lowing limit theorems which establish the asymptotic behavior of T„ , Tn and Tn^s- 



Theorem 2.2. // (1.1) and (2.4)-(2.6) hold, then, under Hq, the statistic Tn (2.2) 
satisfies, for all x. 



lim PlTn<x + 2 log log n + [p + l) log log log n — 2 log ( 
= exp(-2e-"/2). 



/2(p+i)/2r((p+i)/2) 



p+l 



The following corollary to Theorem 2.2 is useful in its practical application. 
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Corollary 2.1. If (1.1) and (2.4)^(2.6) hold, then, under Ha, the statistic f„ (2.3) 
satisfies, for all x, 

lim P{ Tn < X + 21oglog/i(7i) + {p + l)logloglog/i(7i) - 21og '-^^ '-^ 

= exp(-2e-"/2)^ 

where h(n) ^n([ogn)'^ with an arbitrary real 7. 

Instead of the statistics T„ (2-3), the foUowing statistics can be used: 



r„,i = i max S^C-^C^C-^S, 



(7^ p<k<n—p ^ ^ 

Tn,2 = max ■ — Tn — ^fc'^fc ^"^fe 

p<k<n~p ai. -^ + (7 2 

T„_3=( min [o-fc^i +6-^,2] I max SlC^^C^C-^Sk. 

yp<fe<ri— p y p<k<n—p 

Indeed, using the Taylor expansion of log(l + x), —1 < a: < 1, it is easily verified that, 
under TJq, 

|T„-r„,,HOpQ^T2„ i = l,2,3. 

The statistics T„.i, T„_2 and Tn.3 differ only in the built-in estimation of the variance 
parameter . Under the assumptions of Theorem 2.2 and further mild regularity condi- 
tions. 



1 



log log n 



and 

max lo'fc 1 + (Tt 2 ~ o'^l = Op f 5!— 

p<k<7i-p' '^'^ '"^ V log log"'/ 

as n— * 00. Hence, the results of Theorem 2.1, Theorem 2.2 and Corollary 2.1 also apply 
to the test statistics r„,i, Tn.2 and T„^3. 

The proof of Theorem 2.2 and its corollary is prepared in Section 4 and completed 
in Section 5. In Section 4, we show that the error sequence can be replaced with in- 
dependent, identically distributed normal random variables. It will be shown that the 
limit distribution of T„ is related to the maximum of the sum of squares obtained from 
continuous-time Gaussian processes which are integrals of Legendre polynomials with re- 
spect to the same Brownian motion. The extreme value asymptotics can then be derived 
utilizing a result of Aue et al. (2007). Next, however, we discuss the practical application 
of our theory. 
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3. Application to finite samples 

The goal of this section is twofold: to explain the application of the test procedure and 
to give an idea of its performance in finite samples. 

By Corollary 2.1, the test rejects the null hypothesis at level a if > c{n,a), where 
the critical value c{n, a) is computed via 

c{n,a) = -21og(-0.51og(l - a)) + g{n,p,-f), 

where, setting h{n) = n{\ogn)^ , 

'2(P+i)/2r((p+l)/2) 



j{n,p,j) = 21oglog/i(n) + (p + 1) logloglog/i(n) - 21og 



p+l 



The symbol log stands for the natural logarithm, p is the order of the polynomial regres- 
sion to be fitted and 7 is a real constant used to calibrate the size (see below). 

The statistic T„ is easy to compute in any statistical software package. Denote by 
Sn, Sk,i and Sk^2 the residual standard deviations obtained from fitting a polynomial 
regression to, respectively, the whole data set, the first k observations and the last n — k 
observations. Then, after some simple algebra, we obtain 



Tn 



min {log((fc - p)sl i + (n ~ k - p)sl 2)} - log(n - p) - 2 log(s„) 

p<.k<.n—p 



(3.1) 



We recommend reducing the range over which the minimum is taken by one data point 
on each side, that is, to use the minimum over k=p + 2, ...,n — p — 2. An implementation 
in R (for p — 2) is displayed in Figure 1. 

The test described above has good size, even in small samples. We considered the linear 
regression j/i = /3o + f^iXi + Si and the quadratic regression yi = f3o + PiXi + f32xf + £i with 
i.i.d. standard normal errors e^. In both cases, we set Pq = 1. For the linear regression, we 
varied the slope f3i. For the quadratic regression, we set Pi = and varied the coefficient 
P2 of the quadratic term. The empirical sizes based on one thousand replications are 
displayed in Table 1 - they are within two standard errors of the nominal sizes. Note 
that for the linear regression we used 7 = 0, as suggested by our main Theorem 2.2, but 
for the quadratic regression, we used 7=1. If 7 = is used for p~2, the rejection rates 
exceed the nominal size by a few percentage points for the samples sizes we considered. 

To assess the power, we used the same general setting as for the size study and con- 
sidered the following change-point models: 



p = 


l:(3o 


= 1, 


f3i 


= 1 


fori<fc*; 






Po = 


0, 




Pi 


= 


for i> k*; 






p = 


2:/3o 


= 1, 


Pi 


= 0, 


P2^2 


for 


i < k* 


Po = 


0, 




Pi 


= 0, 


P2 = 


for 


i>k* 
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Table 1. Empirical size (in percent) of the test derived from Corollary 2.1 



p = l(7 = 0) p = 2(7 = l) 





Pi = 


0.5 


/3i = 


1.0 


/3i = 


2.0 


P2 = 


0.5 


/32 = 


1.0 


P2 = 


2.0 


n 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


5% 


50 


10.3 


5.3 


10.2 


6.4 


09.5 


5.4 


10.9 


6.7 


11.7 


7.0 


09.4 


5.1 


100 


10.2 


5.2 


10.0 


5.7 


11.4 


6.4 


09.5 


5.9 


08.7 


5.8 


10.0 


5.9 


200 


11.7 


6.3 


11.4 


6.0 


09.3 


6.2 


10.9 


5.3 


10.2 


5.6 


10.4 


6.6 



We considered k* = n/2 and k* = njh. 

Even though these changes appear large, one must keep in mind that they arc, in fact, 
small relative to the standard deviation of the errors. Examples of scatterplots of the 
simulated data are shown in Figures 2 and 3. It is seen that in many cases, a change- 
point is not apparent without prior information on its existence. For example, in Figure 2, 
n = 200, k* = 40, the data appears to fit a linear regression with a negative slope. A visual 
examination of the remaining panels reveals that the changes we consider arc difficult 
to identify by eye. Nevertheless, Table 2 shows that these changes can be detected with 
non-trivial power. As for most change-point detection procedures, it is easier to detect a 
change in the middle of the sample. 

In summary, the results of this section show that our procedure is very easy to use in 
practice and gives good results, even in small samples. 

hT2=f unction (Y) #p=2, Y - responses 
{ 

n=length(Y); Xl=(l:n)/n; X2= X1*X1 
v=rep(-l, n) 
f or(k in 4: (n-4) ) 
{ 

lmkl=lm (Y [1 : k] ""Xl [1 : k] +X2 [1 : k] ) 

link2=lm(Y[(k+l) :n]~Xl [(k+1) :n] +X2[(k+1) :n] ) 
skl=suinmary ( Imkl) $s igma 
sk2=summary (lmk2) $sigma 
V [k] = (k-2) *skl*skl+ (n-k-2) *sk2*sk2 

} 

v=v[4: (n-4)] 

lmA=lm(Y X1+X2) ; sA=summary(lmA)$sigma 
-n*(min(log(v)) - log(n-2) - 2*log(sA)) 
> 

Figure 1. R code for computing the statistic Tn (2.3) for p — 2. 
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10 



20 



30 



40 



50 



n=100, k*=50 




o o 



n=100, k*=20 



o o 

oo 



o 

o 

m 
o 



" o„ o o o o 



cP 
o , 



o o 

O O o 
o 



20 



40 



60 



80 



100 



n=200, k*=100 




n=200, k*=40 



o 



<S>. 



O o 



o 



"~r~ 
50 



100 



150 



200 



Figure 2. Scatterplots for linear regressions with change-points. 



4. Asymptotic representation of the test statistic 

The aim of this section is to derive a more convenient version of the test statistic T„ , which 
includes (a) a transformation that uses only the range of those time-lags k contributing 
to the extreme value asymptotic, and (b) a continuous-time modification which involves 
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Figure 3. Scatterplots for quadratic regressions witii change-points. 



the integration of normahzed Legendre polynomials with respect to standard Brownian 
motion. Throughout, wc work under the null hypothesis Hq. 

The following auxiliary lemma will prove useful. It is a consequence of the approxima- 
tions assumed in (2.5) and (2.6). 
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Table 2. Empirical power (in percent) of the test derived from Corollary 2.1 







p = 


1 (7 = 0) 






P = 2 (7 


= 1) 




k* =n/2 




k* — n/5 




fc* =n/2 




k* = n/5 




n 


10% 


5% 


10% 


5% 


10% 


5% 


10% 


5% 


50 


45.4 


34.1 


40.1 


29.0 


36.5 


28.5 


16.6 


10.2 


100 


80.5 


71.6 


67.1 


56.6 


68.0 


57.9 


24.2 


17.1 


200 


99.2 


98.7 


94.3 


91.4 


96.1 


94.2 


46.4 


35.1 


400 


100 


100 


100 


98.2 


100 


100 


78.7 


71.0 



Lemma 4.1. Under assumptions (2.5) and (2.6), for all < i < p 

L*J 



l<t<n/2 t ' 



n/2<t<n-l (^ — t) ^ I 

as n—foo. Also, for each n, 



LI J pt 



Op{l) 



(4.1) 
(4.2) 



x'AWi,n{x):s>Q\ = { W 



x'^AW2,n{x):0<s<n-l \ = \W 



2i + l 

(n-5)^'+i 
2i + l 



s>0 , 



:0 < s < n- 1 



(4.3) 
(4.4) 



Proof. Note that, by the modulus of continuity of a Brownian motion (see Csorgo and 
Revesz (1981), Theorem 1.21), condition (2.5) can be rewritten as 



sup 

l<t<n/2 



1 



tl/2-A 



^e,-aW^i,„(<) 
i=i 



:Op(l) (n-*oo). 



Applying integration by parts hence yields (4.1). The proof of (4.2) follows in a similar 
fashion from the approximation in (2.6). 

The statements given in (4.3) and (4.4) follow from computing the corresponding 
covariance functions. This suffices since the integral processes in (4.1) and (4.2) are 
Gaussian with mean zero. □ 

Observe that, by definition, C„ = + C^. Therefore, 
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with the two terms on the right-hand side being the subject of study in the following. Let 
II • II denote the maximum norm of both vectors and matrices. We obtain the following 
orders of magnitude for the matrices Cfe and the inverse matrices C^^. 



Lemma 4.2. As oo, 

1 , 



max 

l<fc<71 



Oil) and max ||C 

l<fe<n/2 



O 



(4.5) 



Proof. The first statement in (4.5) follows directly from the definition of Cfc. For the 
second statement, elementary approximations of sums with integrals imply 



sup 

0<t<l/2 



— C|„t| — C; 



o{l) (n^oo), 



where, for t e [0, 1/2], the matrix Ct — {Ct{i,j) '-0 < i,j < p} is defined by 



Ct{i,j)= / x'+'dx 



1 



By definition, Ct is continuous on [0, 1/2]. Observe, moreover, that Ct is the covariance 
matrix of the random variables x'^dW{x), 0<i<p, where {W^(s) :s > 0} denotes a 
standard Brownian motion. Since these variables arc linearly independent, Ct is non- 
singular for any t € [0, 1/2]. Hence, the proof is complete. □ 

Next, observe that the vector Sfc can be decomposed into a partial sum vector of e^'s, 
each of the latter random variables weighted with the corresponding polynomial regressor 
X,; (1 < j < fc) and a second term consisting of C^ and the difference /3„ — /3, that is. 



Sfc=Vfe-Cfe(^„-/3), vfe = ^ 



^7' ^ i 



(4.6) 



To investigate the contributing range of indices fc, let a, /? > and define 



a{n) = log" n and h{n) 



{n>l). 



We obtain the following lemma which shows that the term S^Cj, ^Sfc is asymptotically 
negligible. 



Lemma 4.3. // (2.5) and (2.6) hold, then, as 00, 

max S^C,-iS,=Op(l), 

1 < A; < n / 2 

max Sl'C^T^Sfc =C'p(l)log 

a(n) <fc<b(n) 



(4.7) 
(4.8) 
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Proof. By Lemma 4.2 and (4.6), as n ^ oo, 



max STC7^Sk = Opi -] max IISj"^ 



l<k<n/2 \n ) l<k<n/2 

= Op(-\( max ||vfc|p + ||;3„-/3||2 max ||Cfe| 

\n J \l<fe<n/2 l<k<n/2 

Now, the central limit theorem applied to the sequence {/3„ — /3} implies that 

||V^0„-/3)i|=Op(l) (n^oo). (4.9) 

Hence, by Lemma 4.2, 

p„-/3f max ||Cfc||2=Op(n) {n^^). 

l<k<n/2 

On using Lemma 4.1, we get (see the proof of (4.8) below and proceed in the same way) 

max llvfcll^ = C'p(n) (n^oo), 

l<k<n/2 

completing the proof of (4.7). 
Similarly, by Lemma 4.2, 

max SlC^^Sk^Op(-]( max \\vkf + Op(-] max \\Ck\\^ 

l<k<b{n) \n J \l<k<b{n) \n J l<k<b{n) 



= Op 

Applying Lemma 4.1 yields that 



(-\ max \\wkf + Op{\og-^Pn). 

\n J l<k<b{n) 



max 1 1 Vfe 1 1 =0" max max — - 

l<fc<b(ra) 0<i<p l<fe<f)(n) 



k 



+ C'p(l) max max n-'(b(n)y^^+'-'^ , 

0<t<p l<k<b{n) 

where, inserting the definition of b{n), the second term can be estimated by 

, / \ l/2+i-A / X 1/2- A 

1 / n \ n ^ 



max — 5 — < 

o<»<pn* Vlog^n/ Vlog'^n 



Since, for all < i < p. 



1 

max 



(6(n))'+i/2 o<t<b(«) 



V 1 

= . sup 
\/2i + 1 Q<t<i 



x'dW{x) 
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we get that 



1 

max max — r 

0<i<pO<t<&(n) 



, / X i+1/2 , X 1/2 



:Op(l) max max — — g— =Op(l) 



o<i<p o<t<6()i) -n} \ logP n / V log*^ n , 

completing the proof of (4.8). □ 

It remains to examine the term S^C^^Sfc. Applying the expression for obtained in 
(4.6) leads to the refined decomposition 

Si^C^T^S,. = {^l (;9„ - /3)^Cfe)C-i(v, - Cfe(^„ - /3)) 

= vi^C^Vfc - 2(4, - /3)^vfc + (^„ - /3)^Cfc(^„ - /3). 

Orders of magnitude of quantities appearing on the right-hand side of the latter equation 
array are provided in the following lemma. We restrict the discussion to the range fc < n/2, 
the other half of the observations, for which fc > n/2, will be dealt with after Lemma 
4.12. repeating the arguments developed now. 

Lemma 4.4. // (2.5) and (2.6) hold, then, as oo, 

max (4, - /3)^Cfc(^„ - /3) - Op(l), (4.10) 

l<fc<n/2 

max (A„-/3)^Cfc(A„-/3)=Op(l)log-''n, (4.11) 

l<k<b{n) 

max (4,-/3)^Vfc=Op(l), (4.12) 

l<fc<n/2 

max (4,-/3)'^Vfe =C'p(l)log"''/^n. (4.13) 

l<k<b{n) 

Proof. Using Lemma 4.2 and (4.9), we conclude that statement (4.10) holds. For (4.11), 
it can be similarly obtained that 

max (^^- f3fCki^,- f3)=Op(^-^) (n-.oo). 

l<k<b{n) \ n J 

Lemma 4.2 implies that, as n — > oo, 

max jlvfcll =C'p(l) and JL- max |lvfc|| = C'p(l). 
y/n l<k<n/2 yK"^) !<'-•<''(") 

Therefore, statements (4.12) and (4.13) follow from Lemma 4.1 and (4.9). □ 
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Next, we consider the maximum of S^C^T^Sfc on three different ranges determined by 
a{n) and b(n). To this end. we write 

max S^C^^Sfc = max{z„.i,z„_2,-2n,3} 

l<k<n/2 



with 



and 



z„ 1 = max S^C,/Sfc, z„,2 = max S^Cj^^Sfc 

l<k<a{n) a{n)<k<b{n) 



2„,3= max SfcC.. ^Sfc. 

6(n)<fc<n/2 

Lemma 4.5 provides approximations of Zn,ij Zn^2 and z„.3 using the partial sum vectors 
Vfc introduced in (4.6). It turns out that the maximum on all three ranges is completely 
determined by the maximum of the quadratic form v^C^^v^. 



Lemma 4.5. // (2.5) and (2.6) hold, then, as oo, 

max " 

l<A:<a(Ti) 



z„,i= max v^Cj^^ Vfc +Op(l), 



z„,2 = max VfcC^^VA; +e'p(log '''/^ri), 

a{7i)<k<b{n) 

z„,3 = max VfeCjTVfc +Op(l). 

b(n)<k<n/2 " 

Proof. All three approximations follow immediately from the foregoing Lemma 4.4. □ 
Introducing the notation 

Sfe = (sfe,o,---,Sfc,p), Sk,i = ^fej, 

Bn^{Bn{t,.]):0<i,j<p}, 
Bn{i,i)^n-\ B^{i,j) = (i^j), 

we obtain, for all 1 < ^ < J^, 

Vfc C-^Vfe = sjB„C-iB„s, = sjD-isfc. 

The diagonal matrix B„ simplifies the partial sums vj, by taking out the norming factor 
n~\ returning the partial sums which are used to rewrite the quadratic form, as done 
in the last display above. Note that the vectors possess covariance matrix cr^Dfc. 



(4.14) 
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We now transform into a new vector which has uncorrelated components via an 
apphcation of the Gram-Schmidt orthonormalization. To define = {sk,0: ■ ■ ■ i Sk,p)'^ , let 
Sk,o = Sk,o and set, recursively, for 1 <i <p, 

Ska = Ska — ak,i,OSkM — ak,ia~lSka~l- (4-15) 

The coefficients ak.ij are given as projections of Sk^i onto the new variables Sk,j (j < i), 
that is, 

Esk,iSk.j n / ■ ^ • / 

Consequently, Esk^iSkj = for < i j < p- According to Rao (1973), the matrices 
(7~^D^ are related to the Gram-Schmidt orthonormalization in (4.15) via the equality 

sla-'B^'/' = ( , , . . . , ^ ) . (4.16) 

\y/YaiSk,o ^/Ya.vsk,pJ 

Next, we study moments of Sks and Sk^i - Note that the precise limits given in the following 
lemma will be specified in Lemma 4.10 below. 

Lemma 4.6. // assumption (2.4) is satisfied, then, for all 0<i<p, 



Furthermore, for all < j < i < p. 



Proof. Since, by assumption (2.4) on the errors {£i}, 



£=1 



(4.17) follows after elementary calculations. Relations (4.18)-(4.20) can be established 
using mathematical induction and the definition of the coefficients ak^ij ■ Since the precise 
limits are obtained below, we only show here that, in (4.18), the limit Pi > 0. To this 
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end, writing 

k 

Sk,i = ^{(^' -ak,i,o -C(k,i,ii ak,i,i-it^^)ei, 

assumption (2.4) yields that 

k 

ESfc,,; = (j'^Y^i^' - ak,uO - akaA^ aka,r-l('~^)^ ■ 

Therefore, 

hm T^xf = / (t' - oiifi - ai,it ~^)^ dt 

fe— ►oo fc^'+J^ Jq 

so that Pi > 0, since the integrand is a non-negative, non-constant polynomial. □ 
Lemma 4.7. // (2.^)~(2.6) hold, then, as n^oo, 

max s^D^isfe = Op(logloglog7i), (4.21) 

l<k<a{n) 

max s^D^^Sfc = C'p(logloglog7i) (4.22) 

6(n)<fc<n/2 



2:„,i = C'p(logloglogn), (4.23) 
2:n,3 = Op(logloglogn). (4.24) 

Proof. According to Lemma 4.5, it is enough to prove (4.21) and (4.22). Using Lemma 
4.1 and the Darling-Erdos law for Brownian motions (cf. Csorgo and Revesz (1981)), we 
get, for all < i < p, 



1 



l<fe<a(n) 



Op(v/logloga(n)) (n->cxD). 



1=1 

Hence, (4.22) follows from (4.15), (4.16) and Lemma 4.6. □ 
(4.23) and (4.24) imply that, for any c> 0, 

P 

max{z„_i, Zn^a} — cloglogn — > — oo {n oo) 



so that it suffices to consider the asymptotics for the remaining term z„ 2- In what follows. 



we are going to approximate s^D. ^s^ with a quadratic form consisting of normal random 



654 



A. Aue et al. 



vectors. To this end, set iV,; = Ni^n = Wi^n{i) — Wi^n{i — 1), where {W„^i(s) : s > 0} is the 
Brownian motion defined in assumption (2.5). Let 



k 



i=i 



Lemma 4.8. // (2.4)^(2.6) hold, then, as n^oo, 

i , A^k^^k-- max n^D-^rifc + Opilog-"'^ ny^loglogn). 

O a{n)<k<h{n} a{n)<k<b{n) 

Proof. For fc > 1, set hkfl = n^.o and define, recursively, for I <i <p, 

nk,i = nk,i - otkAfifikfi akA,i-ink,t-i- (4-25) 

Following the arguments leading to (4.15) and (4.16), we get that 

nfeDfe =^=,---,^= ■ (4.26) 
From the proof of Lemma 4.1, we obtain, as n — > oo, 

k k 



1 

max 



1=1 1=1 



= Op{\). (4.27) 



l<k<n/2 fc*+l/2-A 

Combining (4.15), (4.16) and (4.26) with Lemma 4.6 leads to 

max ||(a-is^-nDD,^/^||=Op(log-"^n). 

a{n)<k<n/2 

By the Darling-Erdos law for the Wiener process and Lemma 4.1, wc get 

max \\a~^slT>-"^\\^Op{^/\^g^i) = max \\cj-^nlT>^"\ 

l<k<n/2 l<k<n/2 

Therefore, the lemma is proved. □ 
Next, we replace the discrete normal partial sums rik^i with continuous integrals. Define 

qn(0 = (9«.o(i),---,9n,p(i)), qn.iit)= x'dWi^n{x), 

Jo 

with {Wi^n{s) : s > 0} being the Brownian motion defined in (2.5). 
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max \qn,iit) -nk,t\^Op{k'^/k)gk). 

k<t<k-\-l 

Proof. Using the modulus of continuity of a Brownian motion (cf. Csorgo and Rcvesz 
(1981), Theorem 1.2.1) and integration by parts, we obtain that 



max 

k<t<k+l 



xM(W^i,„(x)-W^i,„(La:J)) 



< sup f|l^i,„(t)-T^i,„(LiJ)| + 
fc<t<fe+i 



sup 

fe<t<fc+l 



ia;'-i(W^i,„(a;)-VFi,„(LxJ))dx 



:Op(fc'ybifc)+Op(l) / x'-Vlog(NI+2)dx 



proving the result. 



□ 



Copying the arguments leading to (4.15), we define (jnflit) = qn.o{t) and, recursively, 
for 1 <i <p, 

qn,t{t) = qn,t - aifi{t)qnfi ai,i-i{t)qi^t-i{t) . 

Observe that since the distribution of {Wi,n{s) : s > 0} is independent of n, so are the 
coeffcients aij{t). By construction, Fiqn,i{t)qnj{t) = for <i^ j < p because, clearly. 



^qn.%{t)qn,j{t) 



<j <i<P- 



Egt,(i) ' 

Denote by 

Dt = {A(i,j):0<z,j<p}, Dt{i,j) 
the covariance matrix of the random vector q„ (t) . 
Lemma 4.10. // (2.4) holds, then 

a^j (t) = f "•'ai J , Q<i<i<p 



where atj are constants. 
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Proof. This is easily verified using mathematical induction or the scale transformation 
of Brownian motions. □ 

Lemma 4.11. If (2.4)-(2.6) hold, then, asn^oo, 

sup |nf,jD-;/^ -qJ(t)D7i/^| =Op(log-"/2 v/bil^). 

a{n)<t<n/2 

Proof. Recall (4.26) and note that the difference between the coefficients ak,i,j and 
aij{t) is that the first expressions are defined as Riemann sums which approximate the 
integrals defining the latter. It can, hence, be easily verified that 

\a^^j{t)~ak,t,j\ ^^fl 



sup ' = C 7 (A: oo) 



fc<t<fe+l Olk,ij 



The result now follows from Lemmas 4.8-4.10. □ 
Define 

(E[Joff^.t(2;) dW^i,„(x)]2)i/2 

where {VFi.„(s) : s > 0} is defined in (2.5) and 

gi^t{x) = - ai^of" ~ ai^if~^x au-itx^~^ . 

Note that the distribution of qri(0 docs not depend on n. Thus, we arrive at the following 
result. 



Lemma 4.12. // (24)-(2.6) hold, then. 



as n ^ oo, 



2.--.= sup C mn (t) + Op ( v/log log ?i[log-"^ n + log ) . 



Zn.2 



a{n)<t<b{n) 

Proof. This follows on combining the results of the foregoing lemmas. □ 

On summarizing what has been proven thus we conclude that, restricted to 1 < fc < 
n/2, the test statistic r„ introduced in (2.2) can be asymptotically characterized by the 
supremum (over a restricted range determined by a(n) and 6(n)) of the inner product 
q^(t)q„(t), where the vectors qn(i) have components given by the (normahzed) integrals 
Jq gi,t{x) dWi^n{x). Clearly, the functions gi^t{x) are polynomials of order i, whose leading 
coefficient (i.e., the coefficient assigned to the monomial x^) is 1 and, by construction, 
they possess the property gi_t{x)gj^t{x)(ix = if i ^ j. So, the functions gi.t{x) are 
exactly the Legendre polynomials on the interval [0, t] with normalized coefficient for the 
monomial This completes the proof for the range 1 < k < n/2. 
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In the remainder of the subsection, we consider max„/2<fc<)i Sl'C^^Sfe, adapting the 
arguments developed before. First, observe that 

n n 
i=l i=k+l 

Let 

c{n) =n Y — and d(n) = n — log" n. 

Along the lines of the proofs of Lemmas 4.1 and 4.2-4.7, one can establish the following 
orders of magnitude: 

max S^C-^Sfc = Op (log log log n) ; 

n/2<k<c{n) 

max S^C^T^Sfc = Op (log log log n). 

d{n)<.k<in—p 

Let 

n-l 

rl = {rk,o,...,rk,p), rk,^= ^ fej 

and R^^ = BnC'^^ Bn, with B„ defined in (4.14). As in Lemma 4.5, we obtain that 
r .5f<^.r ,SrC,7iSfc= max r^^R^T^rfc + Op(log-^/2 n). 

c(n)</i:<a(nj c(n)<k<a{n} 

Repeating the arguments that lead to the statements contained in Lemmas 4.6-4.12 here 
yields 

~2 , rfcR-fc^rfc= sup B^{t)'kt^e{t), 

a c(n)<k<d(n) c(n)<t<d(n) 

where, using the Brownian motion {VF2.,i(s) : s > 0} specified in assumption (2.6), 



e^(t) = (eo(i),...,ep(i)), e,[t) = j xMW2,„(x), (4.28) 

/n 
x'+' dx. 

If we finally define 

e^(t) = (eo(t),...,gp(i)), e,{t)^ I {n-xYdWi^x), 

Jo 

Rt = {i?t(i,j):0<i,j<ri, Rt{i,])^ I {n^xy+^dx, 

Jo 



658 



A. Aue et al. 



where {VF(s) : s > 0} denotes a standard Brownian motion, then 

sup e^(i)Rf"^e(t) = sup {t)'R';^e[t). 

c{n)<t<d{n) a(n)<t<b{n) 

The orthonormaHzation in e^(i)Rf starts with W{t) and the coordinates are uncor- 
related, the ith coordinate being the integral of an ith order polynomial with respect 
to {W^(s) :s > 0}. Hence, wc arc again on the restricted range determined by c{ti) and 
d{n), integrating the Lcgcndrc polynomials on [0, t] with respect to the Brownian motion 
{W{s):s>0}. 

The final lemma establishes that the suprcma taken over the first part. a{n) < k < b{n), 
and second part, c(n) < k < d(n), have the same distribution, but arc independent. 

Lemma 4.13. For each n, there are {e„(t), <t <n} ^ {^(i), <t <n} such that 

sup e^(t)Rt"^e„(i) = sup q^(i)q"(0> 

a{n)<t<b{7i) a{n)<t<b{n) 

where the first and second suprcma are independent. Moreover, as n — > oo, 

c{n)<k<d{n) a{n)<t<b{n) 

+ Opiilogn)-^'/'-) + Op(v/loglogn[log-"^ n + log"" n]). 
Proof. This is immediate. □ 



5. Proofs of Theorem 2.2 and Corollary 2.1 

The final section completes the proof of Theorem 2.2 and its corollary by combining 
the results obtained in the previous section with the extreme value theory for stochastic 
integrals of Legendre polynomials with respect to Brownian motions as developed in Aue 
et al. (2007). 

Proof of Theorem 2.2. Let 

c{n,x) ^ X + 2 log log n+ {p+1) log log log n — 2 log 

where T{t) denotes the Gamma function. If follows from Lemmas 4.3-4.13 that, as n — > oo, 

\P{Tn < c{n,x)} - P{max{e„4,e„,2} < c{n,x)}\ ^ 

for all X, where and ^„^2 are independent, identically distributed random variables 
satisfying 

Cn,i=6i,2= sup q^{t)qn{t). 

a(n)<t<b(n) 



/ 2tP+^)/-'r((p+l)/2) 

I 7+1 
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log log n — log log ft- (n) = log ( I + 7- 

and 

logloglogn — logloglog/i(7i) = log! 1 



Therein, the quantities Qnit) are the Gaussian processes introduced in Lemma 4.12. An 
application of the main result in Aue et al. (2007) implies that 

lim P{e„.»<cKa;)}-cxp(-e-"/2), i==l,2, 

n — >oo 

and, consequently, the assertion of Theorem 2.2 follows readily. □ 

Proof of Corollary 2.1. We need only observe that 

loglog7il"^\ 

^0 (n^oo) 

logn J / 

log log n — log log h{n) 
loglog/i(n) 

since loglog?i — loglogft(n) —>■ and \oglogh{n) -^00 as n 00. □ 
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